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Abstract. Using the formalism of the biased random walk in random uncorrelated networks with arbitrary 
degree distributions, we develop theoretical approach to the critical packet generation rate in traffic based 
on routing strategy with local information. We explain microscopic origins of the transition from the flow 
to the jammed phase and discuss how the node neighbourhood topology affects the transport capacity in 
uncorrelated and correlated networks. 



Transport phenomena in real networked communica- 
tion systems, such as the Internet [TJ and WWW [2], has 
turned recently more and more attention in physical and 
computational science. Since the rapid development of so- 
ciety entails demands for high transport efficiency, scien- 
tists strive to develop methods for understanding and con- 
trolling traffic congestion on communication systems. 

In the basic models frequently used to mimic transport 
phenomena in communication networked systems 3 4 5 6 7, 
all nodes in a network are equally considered as hosts and 
routers for generating and delivering packets. Then, the 
traffic dynamics is defined as follows: 

— At each time step, there are R packets generated in 
the system, with randomly chosen sources and desti- 
nations. 

— During the next time steps packets travel around the 
network, and look for their destination-nodes. Once 
a packet arrives at its target, it is removed from the 
system. 

— To navigate packets, each node performs a local search 
among its neighbors. If the packet's destination is found 
within the search area, it is delivered directly to the 
target. Otherwise, the particle is forwarded to the next 
node according to the prescribed strategy. 

— At each time step every node can distribute/deliver 
at most C packets towards their destination (the fixed 
value of C reflects the limited router bandwidth). 

— The queue length of each node is assumed to be un- 
limited and the FIFO (first in first out) discipline is 
applied. 

In these models, one can distinguish between two kinds 
of strategies: 

— In the first kind, each node has the global topologi- 
cal information about the network, which allows pack- 
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ets to be forwarded either following the shortest path 
[3I4I8I9] or using the concept of betweenness |6ll0lllj , 
which measures the number of total shortest paths that 
pass through the given node. This kind of strategy may 
be practical for small or medium size networks, but 
not for very large networks in real communication sys- 
tems such as the Internet, WWW, peer-to-peer net- 
works 12. or urban transportation systems |13ll4j . 
— The strategies of the second kind base on local infor- 
mation (each node only knows its neighbourhood) and 
are favored in very large networks due to heavy com- 
munication cost of searching. 

One of the most important measurements for trans- 
port performance of a network is the traffic capacity, R Cl 
i.e. the critical packet generation rate. At R c , the network 
undergoes a phase transition from the free flow state to the 
congested state. When the packet generation rate R is be- 
low i? c , the number of generated and delivered packets are 
balanced and therefore the network is in free flow state. 
On the other hand, when R goes beyond R c , the num- 
ber of packets keeps on increasing with time and leads to 
congestion, simply because nodes cannot deliver too many 
packets at each time step due to limited delivering capac- 
ity. 

Although a number of empirical strategies for improv- 
ing transport efficiency has been proposed (see the review 
[Po] and references therein, and also [H]), the theoretical 
background of traffic congestion phenomena is not well de- 
veloped. With reference to this theoretical line of research 
we would like to highlight Ref . [1] , where the estimation of 
traffic capacity for the stategies of the first kind has been 
provided. 

In this paper, we present a theoretical approach to the 
critical packet generation rate applied to the strategy of 
the second kind proposed by Wang et al. [17] . The strategy 
is based on the biased random walk. In this strategy the 
next position of the packet (node j) is chosen according 
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to the prescribed preferential transition probability nii 



kf 



(1) 



where the sum in the denominator runs over neighbors 
of the node i, which represents the current position of 
the packet, and the exponent a is the model free parame- 
ter. Note that according to the formula ([1]) the transition 
probability from i to j depends only on the connectivity 
of the next-step node j. Note also that for a — we re- 
cover the ordinary unbiased random walk studied by Noh 
and Rieger [T8] . In the model, the so-called path iteration 
avoidance is assumed, which means that no link can be 
visited twice by the same packet. 

The main result of this model is presented in Fig. 1. 
The black squares (numerical simulations) show that the 
optimal performance of the system (the largest traffic ca- 
pacity) corresponds to a = — 1, which represents the anti- 
preferential transition probability Wij ~ l/kj. An interest- 
ing finding arises from the comparison of Fig. la and lb 
showing results for classical random graphs and networks 
with power law node degree distribution, respectively. In 
the former case, the function R c (a) has a smooth shape. 
In the later case the character of the relationship changes 
sharply. In what follows, we explain the observations with 
the help of a simple theoretical approach. 

Let us start with the simple observation: the congested 
phase occurs when the average number of packets N p (i, t, R) 
arriving at a certain node i at time t exceed its processing 
capacity C^. Thus the critical value of the posting rate R c 
should be somehow found form the balance condition 

d=N p (i,t,R c ), (2) 

which, for Cj = const, can be also rewritten as 

C = N p (i,t,R c ). (3) 

Note that in the free flow state one has C > N p (i,t,R) 
whereas in the congested phase there is C < N p (i,t,R). 
In the considered model, N p (i,t, R) can be written as 



N p (i,R) = PrN p (R), 



(4) 



where the stationary occupation probability, P°° , describes 
the probability that the particle is located at the node i in 
the infinite time limit (for that reason we have ommited 
time dependence of N p ) and N p is the total number of 



packets in the network. It has been shown [19] 
help of biased random walk formalism, that 



'„a+l 
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with the 



(5) 



where N is the network size. Note that for a = 0, which 
stands for the unbiased random walk, the stationary dis- 
tribution is, up to normalization, equal to the degree of 
the the node i, i.e. P°° ~ It means that the more links 
a node has, the more often it will be visited by a random 




Fig. 1. The critical packet generation rate R c calculated 
for different values of the parameter a with network size 
TV = 1000 and capacity C = 10 in a) classical random 
graphs and b) scale- free networks P(k) ~ /c~ 7 with the 
characteristic exponent 7 = 3. Relatively large value of 
average node degree ((fc) = 12 in case (a) and (k) = 9.3 in 
case (b)) ensures connectivity of the networks, i.e. there 
exists a path between each pair of nodes. Black squares 
represent results of numerical simulations, solid lines cor- 
respond to theoretical prediction of Eq. (7), while dashed 
lines have been calculated from Eq. (7) with P°° replaced 
byP£. 



walker. Note also that for a = — 1, the stationary occupa- 
tion probability is uniform P°° = 1/N. The same scaling 
behavior as given by Eq. (JSJ was found in Ref. [T7] for 
the number of packets moving simultaneously on BA net- 
works [2U] in the free flow state. It means that in the free 
flow, and also in the critical point which is the limiting 
case of free flow state, the packets may be considered as 
non-interacting particles (i.e. independent biased random 
walkers). 

The last observation allows us to calculate the total 
number of packets distributed over the whole network N p 
in the free flow state as: 



N p = R(Ti- 



(6) 



where (Ty) stands for the mean first-passage time aver- 
aged over all pairs of nodes. (Ty) can be understood as 
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the mean lifetime of a packet and can be calculated theo- 
retically from Eq. (19) in [19]. 

Now, combining Eqs. and (JSJ) one can find the 

critical value of packets generation rate R c : 

Rc = WWY (7) 

The last equation shows that the critical value of pack- 
ets generation rate is, due to P°°, a function of a node 
degree ki . It means that in heterogeneous networks nodes 
of different degrees become congested for different values 
of R. It also means that although the system as a whole 
enters the jammed state even if one node is congested, it 
still possesses partial capacity for forwarding packets in 
this phase. 

To find the critical value of R c , which reflects simu- 
lation results presented in Fig. 1, from the whole set of 
different degree dependent values of R c one has to choose 
the smallest one. Since R c is inversely proportional to Pf , 
choosing minimal value of R c corresponds to taking nodes 
with the highest P°°. In Fig. 2, P?° is presented as a func- 
tion of a node degree k for different values of the parameter 
a and for two different network topologies. Solid lines cor- 
respond to theoretical prediction of Eq. ((SJ) whereas black 
squares are results of numerical simulations. In both kinds 
of networks for a < —1, P°° is maximal for nodes with 
the smallest degrees and in this range of the parameter 
a these nodes become congested first. For a > — 1 the 
situation changes and congestion starts in nodes with the 
highest degeees. 

Solid lines shown in Fig. 1 represent theoretical esti- 
mation of R c (a) as given by Eq. ([7]). The lines have been 
calculated taking into account P°°(ki = k m i n ) for a < — 1 
and P°°(ki = k max ) for a > — 1. The mean lifetime of 
a packet (Ty) in Eq. flTJ has been calculated numerically 
because its theoretical estimation given by averaging Eq. 
(19) in Ref. [TO] due to applied approximation (see Eq. 
(16) in Ref. [TI5]) gives correctly only the order of magni- 
tude of (T i:j ). 

Although the character of theoretical lines in Fig. 1 
reflects the shape of numerical results and confirms the 
numerically found optimal performance of the system for 
a = — 1, it is still far from perfection. The discrepancies 
are caused by the fact, that Eq. ([5]) describes only the 
average occupation of a node with a given degree k. In 
reality, P°°(k) may differ among the nodes of the same 
degree. In general, the differences depend on the topology 
of a node neighbourhood. Therefore one can write formally 

pr(k) = p%(k) = <pt? r (k)), (8) 

where p°° r (k) is a stationary occupation probability for a 
node with particular neighbourhood topology r and the 
average is calculated over all possible such topologies. The 
black square symbol in Eq. © has been used to reflect the 
fact that we are talking about results marked by black 
squares in Fig. 2. 

The triangles (circles) in Fig. 2 represent stationary oc- 
cupation probability calculated for the most (the least) fre- 
quently occupated node among all nodes with the same 
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Fig. 2. Stationary occupation probabilities calculated for 
different node degrees k and different values of parameter 
a in classical random graphs (a)-(d) and scale free net- 
works (e)-(h). Solid lines correspond to theoretical pre- 
diction of Eq. (O. Black squares, triangles and circles are 
the results of numerical simulations of P^(k), P°°(k) and 
P.°^(k), respectively. 

degree (the results are averaged over 100 network realiza- 
tions to get rid of fluctuations) 

P%(k)=mzx{p°°(k)}, 
P£(k)=min{p?(k)}. (9) 

The explained above scenario of congestion means that 
the nodes which become congested first are those repre- 
sented by triangles rather than by black squares. This ob- 
servation allows to understand the difference in shape of 
R c (a) between classical random graphs and networks with 
power law node degree distribution (cf. Fig. 1). In the for- 
mer case, along with the increase of the parameter a the 
initial congestion affects nodes with gradually larger and 
larger degrees. In the later case of scale free networks large 
negative values of a correspond to congrestion of the nodes 
with the lowest degrees. Nearby a w —0.9 (not a — — 1 as 
was estimated in previous studies [17] ) situation changes 
suddenly and the highly connected nodes become respon- 
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sible for the congestion (without the intermediate partic- 
ipation of nodes with middle degrees) . The dashed line in 
Fig. 1 calculated from Eq. ([7]) with P°° replaced by the 
discussed above maximal stationary occupation probabil- 
ity shows excellent agreement with numerical simulations. 

As we have stated before, stationary occupation prob- 
ability P°° calculated for nodes of the same degree is in 
fact the average over all possible topologies of a node 
neighbourhood (cf. Eq. ©). To show the impact of neigh- 
bourhood topology on the probability we have calculated 
the average degree of the nearest neighbor, k nn , for three 
groups of nodes: those marked in the Fig. 2 by squares 
(all the nodes), triangles (the most frequently occupated 
nodes) and circles (the least frequently occupated nodes). 
The results for SF networks and for a = — 1 (correspond- 
ing to the scenario of unbiased random walk shown in Fig. 
2f) are presented in Fig. 3a. Although, in the case of un- 
correlated networks, k nn calculated for all nodes does not 
depend on k (what is confirmed by horizontal character 
of black squares), the Fig. 3a shows that the neighbour- 
hood of the nodes which are responsible for congestion (i.e. 
in case of a = —1 the most frequently occupated nodes 



a) 



with k 



is composed of weakly connected nodes. 



Then one can suggest that increasing their k nn (i.e. mak- 
ing their neighbourhood disassortatively correlated) may 
improve transport capacity of the network. 

To check the hypothesis one has to introduce degree 
correlations to the network. From many methods for gen- 
erating correlated networks (e.g. 21:22]) we select one of 
the simplest model proposed by Noh [23124] . The model 
belongs to a class of the exponential random graph fam- 
ily |25l26j . In this class, a network model is defined as a 
Gibbsian ensemble of networks with an associated network 
Hamiltonian 



H(G) 



J N 

— ciijkikj , 



(10) 



where J is a control parameter and , element of the ad- 
jacency matrix, takes the value = 1 or if nodes i and 
j are connected or not. A positive (negative) correlation 
is favored by a positive (negative) value of J. The Monte 
Carlo dynamics is based on updating network configura- 
tions via the link rewiring process, which preserves the 
degree of each node. The dynamics leads to the Gibbsian 
ensemble in the stationary state. 

The assortativity of the network [21] is measured by 
the Pearson correlation coefficient of the degrees at either 
ends of an edge: 



(fcfcQ,-((fc + fcQ/2)? 
((k 2 + k'^)/2)i - ((k + k')/2)j 



(11) 



where (•); denotes the average over all links, whereas k 
and k' represent the degrees of two nodes at either end 
of links. The sign of r indicates a positive (assortative) 
or negative (disassortative) degree correlation. It vanishes 
for uncorrelated networks. 

The stationary state value of the assortativity for SF 
network is presented in Fig. 4. The possible values of r 
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Fig. 3. The average degree of the nearest neighbor for all 
the nodes (squares) , the most frequently occupated nodes 
(triangles) and the least frequently occupated nodes (cir- 
cles) for r = 0.00 (a) and r = —0.20 (b). Stationary occu- 
pation probabilities calculated for different node degrees 
k for a = — 1 in SF networks (c). 
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Fig. 4. Stationary state values of the assortativity as a 
function of J in SF network. 



in correlated SF networks, which can be obtained with 
the help of thi model belong to the range —0.27 < r < 
0.27. This differentiates SF networks from networks with 
the Poisson degree distribution for which — 1 < r < 1 
(cf. Fig. 2b in [23 ). It seems that the possible patterns 
of correlation in scale-free networks are restricted by the 
power law degree distribution. 

Having the tool for generating correlated networks in 
hands, one can repeat the calculations of k nn for such 
networks. In Fig. 3b we have shown results for the case 



of SF networks with r 



-0.20. In such networks k nn for 



the nodes responsible for congestion has increased from 
about 5.9 (in uncorrelated networks) to about 7.3. Unfor- 
tunately, further increasing of negative correlations will 
have an undesirable effect on the network capacity: de- 
taching weakly connected nodes from other weakly con- 
nected nodes one have to attach them to the highly con- 
nected ones. As a consequence, the transport abbilities of 
hubs decrease. In the Fig. 3c we have shown corresponding 
occupation probabilities for that case. As one can see, P?° 
for the most frequently occupated nodes with k m i n and 
kmax equalize (what is emphasized by the horizontal line). 
Therefore, the case of r = —0.20 is optimal, because any 
change of network correlations can only increase the maxi- 
mal occupation probability (through increasing P°£(k m i n ) 

Finally, in the Fig. 5 we have shown the profiles of 
the critical packet generation rate R c calculated for three 
different values of the parameter r: in the case of uncor- 
related network (r = 0.00), the case of optimally corre- 
lated network (r = —0.20), and the case of highly cor- 
related network (r = —0.27). The interesting observation 
is that although in the most optimal case, k nn calculated 
for the nodes responsible for congestion has increased just 
about one degree (cf. Fig. 3b), in the same time the criti- 
cal packet generation rate R c (a = —1) has increased from 
29 to 41, i.e. about 40%. 

The observation, that transport capacity of a network 
can be enhanced for both assortative mixing and disassor- 
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Fig. 5. Profiles of the critical packet generation rate R c 
calculated for the case of uncorrelated network (r — 0.00) 
(black squares) , the case of optimally correlated network 
(7- = —0.20) (open squares), and the case of highly corre- 
lated network (r = —0.27) (open circles). 



tative mixing has been recently reported in Ref. [27] . In 
the metioned paper, authors have studied routing strate- 
gies with the global topological information. In our case, 
assorative mixing cannot improve the network capacity, 
since it only decrease k nn of nodes responsible for conges- 
tion. 

In summary, using the formalism of the biased ran- 
dom walk in random uncorrelated networks with arbitrary 
degree distributions, we have developed the theoretical 
approach to the critical packet generation rate in traf- 
fic dynamics with the local routing strategy as proposed 
by Wang et al. We have shown that the random walk ap- 
proach can be used to give microscopic explanation of the 
phase transition from free flow to the jammed phase. We 
have also discussed the effect of degree correlations and 
node neighbourhood topology on the properties of trans- 
port in complex networks. 
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